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Short proofs of some extremal results II 


David Conlon* Jacob Fox^ Benny Sudakov^ 


Abstract 

We prove several results from different areas of extremal combinatorics, including complete or 
partial solutions to a number of open problems. These results, coming mainly from extremal graph 
theory and Ramsey theory, have been collected together because in each case the relevant proofs 
are quite short. 


1 Introduction 

We study several questions from extremal combinatorics, a broad area of discrete mathematics which 
deals with the problem of maximizing or minimizing the cardinality of a collection of finite objects 
satisfying a certain property. The problems we consider come mainly from the areas of extremal graph 
theory and Ramsey theory. In many cases, we give complete or partial solutions to open problems 
posed by researchers in the area. 

While each of the results in this paper is interesting in its own right, the proofs are all quite short. 
Accordingly, in the spirit of Alon’s ‘Problems and results in extremal combinatorics’ papers 
and our own earlier paper m, we have chosen to combine them. We describe the results in brief 
below. For full details on each topic we refer the reader to the relevant section, each of which is 
self-contained and may be read separately from all others. 

In Section [2l we address old questions of Erdos and Hajnal [20] and Caro m concerning extremal 
problems for set mappings. In Section jS] we answer a question of Foucaud, Krivelevich and Per- 
arnau |23|, which extends an old problem of Bollobas and Erdos m, about finding large dC^.r-free 
subgraphs in graphs with a given number of edges. In Section U we show how to use the Lovasz local 
lemma to embed sparse hypergraphs in large dense hypergraphs and apply this technique to improve 
the Ramsey number of the cube and other bipartite graphs. In Section [S] we address a problem of 
Erdos and Hajnal [21] on estimating the bounds on a variant of the classical Ramsey problem and 
extend this to estimate the extremal number for small and shallow clique minors. We find a connection 
between induced Ramsey numbers and the Ruzsa-Szemeredi induced matching problem in Section [G] 
Einally, in Section [T] we prove a colored variant of the famous triangle removal lemma with reasonable 
bounds. 

All logarithms are base 2 unless otherwise stated. For the sake of clarity of presentation, we system¬ 
atically omit floor and ceiling signs whenever they are not crucial. We also do not make any serious 
attempt to optimize absolute constants in our statements and proofs. 
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2 Extremal problems for set mappings 


Consider all mappings / : — >■ (Y) with \M\ = m such that X is disjoint from f{X) for all 

X £ (^)- Let p{m,k,l) be the maximum p such that for every such mapping / there is a subset 
P C M with |P| = p where f{X) and P are disjoint for all X G (^). In 1958, Erdos and Hajnal [20] 
proved that, for all k and I, 

< p{m,k,l) < c'{mlogm)^^^, 

where c and d depend only on k and I, and asked for a more exact determination of the dependence 
of p{m, k, 1) on m. 

In 1972, in a textbook application of the probabilistic method, Spencer m proved an extension of 
Turan’s theorem to /c-uniform hypergraphs and used this to prove that p{m,k,l) > where c 

depends only on k and 1. More precisely, Spencer showed that every r-uniform hypergraph with n 
vertices and t edges contains an independent set of order at least Now consider 

the {k + l)-uniform hypergraph with m vertices and the t = edges given by X U {y} for all 
X G and y G f{X). An independent set P in this hypergraph is a set for which f{X) and P are 
disjoint for all X G (^). Since there is an independent set of order at least 
we get the desired lower bound on p{m, k, 1). 

Despite the attention the set mapping problem has received over the years, the upper bound has not 
been improved. Here we solve the Erdds-Hajnal problem when I is sufficiently large as a function 
of k, showing that p{m,k, [k — 1)!) = where the implied constants depend only on k. For 

simplicity, we first describe a construction for I = k\ and then show how to modify it to get I = (A: — 1)!. 

Theorem 2.1 For m = n^, there is a function f : such that |M| = m, X and f{X) are 

disjoint for all X G {^), and every P <Z M of order greater than k‘^n contains a k-set X such that 
f{X) is not disjoint from P. 


Proof: Let M = [n]^ and consider the function / : such that if X consists of the A:-tuples 

{xii,X 2 i-, ■ ■ ■ ,Xki) for i G [k], then f{X) consists of all /c-tuples (a:i 7 r(i), iC 27 r( 2 )> ■ ■ ■ ■:XkTT(k))i where vr is a 
permutation of [k], with the caveat that if (xiTr(i),X 2 Tr( 2 )! ■ ■ ■ is equal to an element of X or a 

previously chosen element of f{X), we instead choose an arbitrary element of M which is not equal 
to any of these elements. With this choice, f{X) is well defined and X and f{X) are disjoint. 

Let P C [n]^ with |P| > k'^n. As long as one of the hyperplanes with one fixed coordinate has at most 
k elements of P, delete those elements from P. As there are at most kn such hyperplanes, there are 
at most deleted points. Therefore, there is a remaining point p G P such that there are at least k 
other points remaining on each of the k hyperplanes containing p with one coordinate fixed. Picking 
distinct points pi,... ,pk G P which are distinct from p with pi having the same coordinate as p 
and letting X = {pi,... ,pfc}, we have p G f{X), which completes the proof. □ 

To improve the bound on I in this theorem from k\ to (k — 1)!, we need a minor modification. Indeed, 
suppose that the elements of M are ordered lexicographically (that is, order is determined by first 
comparing the first coordinates, then the second coordinates and so on) and f{X) is that subset of the 
/c-tuples (a:i 7 r(i),3^271(2)) • • •) XkTr(k)) for which 7r(l) is fixed so that the first coordinate is equal to 

the minimum of the first coordinates of the elements of X. The proof then proceeds as above, but we 
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choose p and pi to be the smallest, in the lexicographic ordering, among the remaining vertices after 
all deletions. This guarantees that pi is the first element in the set {pi,... ,pk}, though the remaining 
elements may be ordered arbitrarily. We note that Z = (/c — 1)! = 1 is best possible when k = 2. After 
this paper was written, we learned that this particular case was independently solved much earlier by 
Fiiredi [30]. It would be very interesting to determine whether I can be decreased to 1 for all k. 

A related question of Caro m (see also M) asks for an estimate on q{m, k, d), the maximum q such 
that for every mapping / ; ('^) —)• with |M| = m such that \X rif{X)\ < d for all X G there 
is a subset Q C M with \Q\ = q such that f{X) is not a subset of Q for any X G (^). This differs 
from the Erdos-Hajnal question on three counts: we take I = k] we allow X to overlap with f{X) by 
a certain controlled amount d; and we only require a subset where the image of each element is not 
contained within the subset rather than being entirely disjoint from it. If we let t = {k — d)/{2k — d— 1), 
Caro m proved that 

cm* < q{m,k,d) < c*(mlogm)*, 

where c and c' depend only on k and d. Here we partially answer a question of Caro m by removing 
the log factor from the upper bound when k = 2. 

Theorem 2.2 There exist eonstants ci and C 2 such that 
(i) q{m, 2,1) < cim^/^, 

(ii) q{m, 2,0) < C 2 m^/^. 

Proof: (i) We define a mapping of the complete graph with vertex set [m]^. li x < x' and y ^ y', we 
map the edge {{x,y), {x',y')) to {{x,y), {x,y')) and otherwise we map arbitrarily while ensuring that 
{{x,y), {x',y')) doesn’t map to itself. 

Suppose now that Q is a subset of order at least 2m +1. On every horizontal or vertical line, we delete 
the highest point which is in Q. Since we delete at most 2m points, some point q € Q must remain. 
If <7 = {x,y'), we see that there are points {x,y) and {x',y') with x < x' and y' < y which are also in 
Q. But, if e = {{x,y), {x',y')), its image is {{x,y), {x,y')), which is also in Q. 

(ii) We define a mapping of the complete graph with vertex set [m]^. If x < x', y ^ y' and z ^ z', 
we map the edge {{x,y, z),{x',y', z')) to {{x',y,z),{x',y,z')) and otherwise we map arbitrarily while 
ensuring that {{x,y,z)), {x',y',z')) is disjoint from its image. 

Suppose now that Q is a subset of order at least 3m^ + 1. On every line {{a,y,z) : 1 < a < m}, we 
remove the lowest point, while on every line {{x,b, z) : 1 < b < m}, we remove the highest point. The 
remaining set still has m? + 1 points. Therefore, there are two points which have the same x and y 
coordinate, say, {x',y,z) and {x',y,z'), where 2 < z'. Since we removed the highest point on the line 
{{x',b, z') : 1 < b < m}, there exists y' > y such that {x',y',z') is in Q. Similarly, since we removed 
the lowest point on the line {{a,y,z):l<a< m}, there exists x < x' such that (x, y, z) is in Q. But, 
if e = ((x, y, z), {x' ,y', z')), its image is {{x' ,y, z), {x' ,y, z')), which is also in Q. □ 


3 Large subgraphs without complete bipartite graphs 

Given a family of graphs TL, we let f{m,TL) be the size of the largest "H-free subgraph that can be 
found in any graph with m edges, where a graph is ^-free if it contains no graph from the family TL 
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as a subgraph. The problem of estimating f{m,T-L) was first raised by Bollobas and Erdos [18] at a 
workshop in 1966, where they asked whether every graph with m edges contains a C' 4 -free subgraph 
with edges. Erdos [19] later remarked that the answer in this case is likely based 

on an example due to Folkman and private communication from Szemeredi. 

Recently, the function f(m,T-L) was rediscovered by Foucaud, Krivelevich and Perarnau [23], who 
considered the case where T-L is the family of even cycles of length at most 2k, obtaining estimates 
that are tight up to a logarithmic factor. In this section, we address a question asked by these same 
authors and extend the Folkman^Szemeredi result by determining up to a constant factor 

when H is a. complete bipartite graph. 

Let Ky^s be the complete bipartite graph with parts of order r and s, where 2 < r < s. The following 
theorem gives a lower bound on f{m,Kr^s)- 
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Theorem 3.1 Every graph G with m edges contains a Kj-^r-free subgraph of size at least . 

To prove this theorem, we need an upper bound on the number of copies of iLr,r which can be found 
in a graph with m edges. The problem of maximizing the number of copies of a fixed graph H over 
all graphs with a given number of edges was solved by Alon [ 2 ] (and the corresponding problem for 
hypergraphs was solved by Friedgut and Kahn [29]). For our purposes, the following simpler estimate 
will suffice. 

Lemma 3.2 Every graph G with m edges contains at most 2m^ copies of K^^r- 

Proof: Note that every copy of in G contains a matching of size r. Since the number of such 

matchings is at most (™) and every matching of size r can appear in at most 2 ^ copies of ATr-.r; the 
total number of such copies is at most 2 ^(™) < 2 m’'. □ 

Using this lemma, together with a simple probabilistic argument, one can prove the required lower 
bound on /(m, AT^^g). 

Proof of Theorem I3.lt Let G be a graph with m edges. Consider a subgraph G' of G obtained by 
choosing every edge independently at random with probability p = Then the expected 

number of edges in G' is mp. Also, by Lemma 13.21 the expected number of copies of in G' is at 
most 2p^ mJ'. Delete one edge from every copy of Ky^r contained in G'. This gives a A'r,r--free subgraph 
of G, which, by linearity of expectation, has at least 

2 T’ ^ 1 ^ 

vm — 2 p’’ m’’ > —m^-m^ > —m^ 

^ ^ “2 8 -4 

edges on average. Hence, there exists a choice of G' which produces a AT^^^-free subgraph of G of size 
at least |m’'+i. □ 

We will now show that this estimate is tight when G is an appropriately chosen complete bipartite 
graph with m edges. Since the Turan number for A'^ g is not known in general, it is somewhat surprising 
that one can prove a tight bound on the size of the largest AT^^g-free subgraph in graphs with m edges. 

Theorem 3.3 Let 2 < r < s and let G be a complete bipartite graph with parts U and V, where 
\U\ = and |U| = Then G has m edges and the largest Kr^s-fr^^ subgraph of G has 

at most edges. 
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Proof: The proof is a simple application of the counting argument of Kovari, Sos and Turan m- 
Let G' be a iL^^s-free subgraph of G and let d = e{G')/\V\ be the average degree of vertices in V within 
G'. If d > s, then, by convexity, 

On the other hand, since G' is Kr^s-free we have that 

vev ^ / \ / 

This contradiction completes the proof of the theorem. □ 

These results can also be extended to fc-uniform hypergraphs, which, for brevity, we call fe-graphs. Let 
(k) 

di^r,...,r denote the complete fc-partite /c-graph with parts of order r. 

(k) 1 ‘i-l 

Theorem 3.4 Every k-graph G with m edges contains a iLr,...,r-/ree subgraph of size at least ^rn , 

where q = 


(k) 

Proof: Let G be a fc-graph with m edges. Every copy of in G contains a matching of size r 

and the number of such matchings is at most . On the other hand, every matching in G of size r 
can appear in at most {kiy copies of Kr^},^r- This implies that the total number of such copies is at 
most {k\Y{y). 

Consider a subgraph G' of G obtained by choosing every edge independently at random with prob¬ 
ability p = Then the expected number of edges in G' is mp and the expected number of 

copies of in G' is at most {kiyp^^ (fyf). Delete one edge from every copy of Kr^},^r contained in 

(k) 

G . This gives a Kf,fr-free subgraph of G with at least 


pm — {k\y p^ 


k 



> 



< 1-1 

1 


(k) 

expected edges. Hence, there exists a choice of G which produces a iLr,,,,,r-free subgraph of G of this 
size. □ 


We can again show that this estimate is tight when G is 


an appropriately chosen /c-partite /c-graph. 


k 1 

Theorem 3.5 Let k,r > 2, q = and let G be a complete k-partite k-graph with parts Ui,l < 

i—\ j 

i <k, such that \Ui\ = rrf Then G has m edges and the largest i^r,...,r-/?^ee subgraph of G has 

edges. 


This result follows from the next statement, which is proved using a somewhat involved extension of 
the counting argument used in the graph case. This technique has its origins in a paper of Erdos m- 
Throughout the proof, we use the notation (*) as a shorthand for *(* It>r-i-, thus extending 

the definition of the binomial coefficient to a convex function on all of M. 
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Proposition 3.6 Let G be a k-partite k-graph with parts Ui,l < i < k, such that \Ui\ = rf and 
with aWj^y 2 Wi\ edges. Then G contains at least ni<fc-i copies of 


Proof: We prove the result by induction on k. We may always assume that a > r + A; — 2, since 
otherwise = 0 and the result is trivial. First, suppose that k = 2 and we have a bipartite 

graph with parts Ui of order n, U 2 of order and an^ edges. Let d{S) denote the number of common 
neighbors of a subset S' in G and let D = ii(S')/(”) be the average number of common 

neighbors taken over all subsets of order r in C/i. Note that 


E <i(-s) = E 

ScUi,\S\=r X&U 2 


d{x) 

r 


> 


\U2\ = 


n 


Therefore, since a > r, D > > a and the number of copies of Kr,r in G is 



completing the proof in this case. 

Now suppose we know the statement for A: — 1. For every vertex x S C/^, let Gx be the {k — l)-partite 
{k — l)-graph which is the link of vertex x (i.e., the collection of all subsets of order A; — 1 which 
together with x form an edge of G). Let axW^Z^ \ Ui\ be the number of edges in Gx- By definition, 

Yhx ~ eL\Uk\ = \ By the induction hypothesis, each Gx contains at least ni<fe -2 

(k—1) 

copies of iLr,...,rh By convexity, the total number of such copies added over all Gx is at least 



where in the final inequality we use that o > r + A: — 2. 

For every subset S which intersects every Ui, i < k — 1, in exactly r vertices, let d{S) be the number 
of vertices x € such that x forms an edge of G together with every subset of S of order A: — 1 which 
contains one vertex from every Ui. By the above discussion, we have that 

Y,d{S)>{a-k + 2) n 

S i<k—l 



(k—1) 

that is, at least the number of copies of counted over all Gx- On the other hand, by the 

dehnition of d{S), the number of copies of Kr^},^r in G equals Since the total number of 

sets S is ni<A:-i average value of d{S) is at least a — A: + 2 and the result now follows by 

convexity. □ 
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4 Ramsey numbers and embedding large sparse hypergraphs into 
dense hypergraphs 

For a graph H, the Ramsey number r{H) is the least positive integer N such that every two-coloring of 
the edges of the complete graph Kj^ contains a monochromatic copy of H. One of the most important 
results in graph Ramsey theory is a theorem of Chvatal, Rodl, Szemeredi and Trotter [T2j which says 
that for every positive integer A there is a constant C(A) such that every graph H with n vertices and 
maximum degree A satisfies r{H) < C{A)n. That is, the Ramsey number of bounded-degree graphs 
grows linearly in the number of vertices. 

The original proof of this theorem used the regularity lemma and gives a very poor tower-type bound for 
C(A). Following improvements by Eaton [16] and Graham, Rodl and Rucihski [31], the bound C{A) < 
2 cA log A gjyg ]2 by the authors M- This is close to optimal, since Graham, Rodl and Rucihski 1311 
152] showed that there exist graphs H (even bipartite graphs) with n vertices and maximum degree A 
for which r{H) > 2^'^n. 

If we assume that H is bipartite, work of Gonlon [TB] and Fox and Sudakov [27| shows that r{H) < 
for any H with n vertices and maximum degree A. By the results of Graham, Rodl and Rucihski 
mentioned above, this is optimal up to the constant c. The bound proved by Fox and Sudakov m, 
r{H) < A2^'’'^n, remains the best known. Here we remove the A factor from this bound. 

Theorem 4.1 For every bipartite graph FI on n vertices with maximum degree A, r{H) < 2^~^^n. 

This theorem allows us to give a slight improvement on the Ramsey number of cubes. The d-cube Qd is 
the d-regular graph on 2*^ vertices whose vertex set is {0, and where two vertices are adjacent if they 
differ in exactly one coordinate. Burr and Erdos [10] conjectured that r{Qd) is linear in the number 
of vertices \Qd\- After several improvements over the trivial bound r{Qd) < ^’(IQdl) < by 

Beck |7j . Graham, Rodl and Rucmski |32j and Shi Hass], Fox and Sudakov m obtained the bound 
'i~{Qd) < d2‘^'^~^^, which is nearly quadratic in the number of vertices. This follows immediately from 
their general upper bound on the Ramsey numbers of bipartite graphs. Theorem 14.11 improves this to 
a true quadratic bound. 

Corollary 4.2 For every positive integer d, r{Qd) < 2^'^'^^. 

We note that results similar to Theorem liT] and Corollary 14.21 were proved by Lee |38| in his recent 
breakthrough work on the Ramsey numbers of degenerate graphs. However, the method he uses is 
very different to ours. To understand our approach, it will be useful to first describe the method used 
in m to prove the bound r{H) < A2‘^+®n. 

Suppose that Lf is a bipartite graph with n vertices and parts Vi and V 2 , where every vertex in Vi 
has degree at most A and every vertex in V 2 has degree at most k. The proof from m has two main 
ingredients. The first ingredient is a powerful probabilistic technique known as dependent random 
choice (see, for example, the survey |28| for a discussion of its many variants and applications) which 
allows one to find a large vertex subset 17 in a dense graph G such that almost all subsets of at most 
k vertices from U have many common neighbors. 

To prove an upper bound on the Ramsey number of iL, we take G to be the denser of the two 
monochromatic graphs which edge-partition the complete graph K]\[, so that G has edge density at 
least 1/2. We use the dependent random choice lemma to find a subset U with the property that 
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almost every subset with at most k vertices has at least n common neighbors. We then form an 
auxiliary hypergraph Q on U hy letting a subset S with at most k vertices be an edge of Q if the 
vertices of S have at least n common neighbors in G. We also define a hypergraph T-L on Vi by saying 
that a subset T with at most k vertices is an edge if there is a vertex of H (which will necessarily 
be in V 2 ) whose neighborhood is T. It is easy to show that if is a subhypergraph of Q, then H 
is a subgraph of G. Thus, to prove an upper bound on Ramsey numbers, it suffices to show that 
every sparse hypergraph is a subhypergraph of every not much larger but very dense hypergraph. An 
embedding lemma of this form is the second ingredient used in [27]. 

To state the appropriate lemma from we say that a hypergraph is down-closed if ei C 62 and 
62 £ E implies ei G E. 

Lemma 4.3 Let T-L be an n-vertex hypergraph with maximum degree A such that each edge ofTi 
has size at most k and suppose that 6 < (4A)“^. If Q is a down-closed hypergraph on vertex set U 
with N > An vertices and more than (1 — 6 ) ('^) edges of size k, then there is a copy ofH in Q. 

The proof of this lemma uses a greedy embedding process. However, we may improve it by a simple 
application of the Lovasz local lemma, which we now recall. 


Lemma 4.4 Let Ai ,..., be events in an arbitrary probability space. A directed graph D = {V,E) 
on the set of vertices V = {1,... ,n} is called a dependency digraph for the events Ai ,..., if for 
each i, I < i < n, the event Ai is mutually independent of all the events {Aj : {i,j) 0 E}. Suppose 
D = {V, E) is a dependency digraph for the above events and suppose there are real numbers xi,... ,Xn 
such that 0 < Xj < 1 and Pr[Ai] < xi 0(4 j)eE(l ~ ®i) /o’" all 1 < i < n. Then 


Pr 



> 11(1 -®*)- 

i=l 


In particular, with positive probability no event Ai holds. 


Using this result, we now improve Lemma 14.31 as follows. 


Lemma 4.5 Let TL he an n-vertex hypergraph with maximum degree A such that each edge ofLL has 
size at most k and suppose that 6 < If G is a down-closed hypergraph on vertex set U 

with N > 16re vertices and more than (1 — d) ('^) edges of size k, then there is a copy ofTL in Q. 


Proof: When k = 1, the result follows from Lemma 14.31 We may therefore assume that k >2. 
Consider a uniform random mapping / : ViTL) — )• V^G). For two vertices uv of P, consider the bad 
event Auv that f{u) = f{v). For an edge e of P, we also consider the bad event Bf, that |/(e)| = |e| 
(the vertices of e map to distinct vertices), but /(e) is not an edge of G- 

Clearly, Pr[Au^] = 1 /N. We also have Pr[i?e] < d. Indeed, suppose |e| = i. If /(e) is not an edge 
in G, then, since G is down-closed, none of the k-sets containing it are in G either. However, 

the number of pairs consisting of an edge of size k which is not in G and a subset of size I is at most 
d{k){^)- If follows that the number of £-sets which are not edges of G is at most 
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which implies that Pt[B(,] < <5. 

Provided {u',v'} and e are disjoint from {u, f}, the event is independent of and Bf,. There¬ 
fore, Auv is dependent on at most 2(n — 2 ) of the events A^i^i and at most 2 A of the events Bf,. 
Similarly, provided {u,v} and e! are disjoint from e, the event B^. is independent of A^v and i?e'- 
Therefore, Bg is dependent on at most ( 2 ) — (”" 2 ^'^^) < of th® events Auv and at most kA of the 
events Be'- 

We now apply the local lemma. For each Auy, we let the corresponding Xi be x and, for each B^, we 
let the corresponding Xi be y. Let x = -^ and y = 2 ^- As x,y < 1/2, we have 1 — x > and 
1 ~ 2 / > 4“^. Therefore, since N > 16n and k > 2, 

x(l - x)2-(l - yf^ > ±4.-^n/N^-l/k > ^ ^ 

and 

y{l - y)^^{l - xf^ > ^ 4 -i/ 24 - 4 fcnW > ^ > Pr[Be]. 

.Z/CZA 

By Lemma f4.4[ the probability that none of the bad events A^v and B^ occur is positive and, therefore, 
"H is a sub hypergraph of □ 

To prove Theorem 14.11 we need an appropriate variant of the dependent random choice lemma. The 
version we use follows easily from Lemma 2.1 of m- 

Lemma 4.6 Let G be a bipartite graph with parts Vi and V 2 of order N and at least eiV^ edges, where 
N > max( 6 n,4A:). Then there is a subset U <Z Vi with \U\ > 2~^^^e^N such that the number of 
k-sets S <ZU with |A^(S')| < n is less than . 

Combining Lemmas 14.51 and 14.61 we arrive at the following theorem. 

Theorem 4.7 Let H be a bipartite graph with n vertices such that one part has maximum degree k 
and the other part has maximum degree A. If G is a bipartite graph with edge density e and at least 
16AVfce-fc n vertices in each part, then H is a subgraph of G. 


Proof: Let N = 16A^/^e ^n. Applying Lemma IT6l with b = 16A^/^ > 2(8A:A)^/^2®”'/'^, we find a set 
\U\ with \U\ > 2~^/^e^N > 16n vertices such that the number of /c-sets S C U with N{S) < n is less 
than 2 ^+^ 6 “^. Since 2 ^+^ 6 “^ < we may apply Lemma 1431 to embed the auxiliary 

hypergraph B in the hypergraph Q (as described before Lemma l4.3p . This in turn implies that H is 
a subgraph of G. □ 

By considering the denser color in any two-coloring, this result has the following immediate corollary. 
Theorem O follows as a special case. 

Corollary 4.8 If H is a bipartite graph with n vertices such that one part has maximum degree k and 
the other part has maximum degree A, then r{H) < A^/^2^^^n. 
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5 Weakly homogeneous sequences and small minors 


In 1989, Erdos and Hajnal |21j studied an extension of the fundamental problem of estimating Ramsey 
numbers. A sequence 5i, ■ ■ ■ ,St of disjoint vertex subsets of a graph is called a weakly complete r- 
sequence of order t if each subset Si has cardinality r and, for each pair 1 < i < j < t, there is an edge 
from a vertex in Si to a vertex in Sj. Let g{r,n) be the largest t for which every graph on n vertices 
or its complement contains a weakly complete r-sequence of order t. Note that determining g{l,n) is 
simply the classical Ramsey problem, since the vertices making up Si,..., St form either a clique or 
an independent set. 

For r fixed and n sufficiently large, Erdos and Hajnal [ 2 T] proved that 

( 1/2 — Or(l))(3/2)^ logn < g{r,n) < 2^^^^r\ogn, 

where the upper bound comes from considering a random 2 -coloring of the edges of the complete graph 
on n vertices. These estimates naturally lead one to ask whether the power of r in the exponent of 
the constant factor should be 1 or 2. Improving the lower bound of Erdos and Hajnal, we answer 
this question by showing that the upper bound is much closer to the truth. Moreover, for r > 2, we 
will show that a density theorem holds, that is, every dense graph contains a large weakly complete 
r-sequence. 

A sequence Si,..., St,Ti,... ,Tt of disjoint vertex subsets of a graph is called a weakly bi-complete 
r-sequence of order t if \Si\ = \Ti\ = r for 1 < i < t and, for each pair I < i, j < t, there is an edge 
from a vertex in Si to a vertex in Tj. Given such a weakly bi-complete r-sequence, SiUTi,..., StUTt 
is clearly a weakly complete 2r-sequence. 

Theorem 5.1 Let n be sufficiently large and G be a graph with n vertices and edge density p. The 
graph G contains a weakly bi-complete r-sequence of order t if 

• p > n-V3^ ^ < 2 p-V 2 and t < 

• p> <r < ■\/p~^ log n and t < log n; 

• r > A-\/p~^ logn and t < min(pn/64-^logn, n/2r). 

In particular, G also contains a weakly complete 2r-sequence of order t. 

Note that if a graph has a weakly bi-complete r-sequence of order t, then, by arbitrarily adding 
additional vertices of the graph to the r-sets to obtain r'-sets, the graph also has a weakly bi-complete 
r'-sequence of order t for any r' satisfying r < r' < n/2t. This is useful for the third bound and for 
interpolating between the bounds. In particular, for the third bound, it will suffice to prove it for the 
case r = 4y^p“i logn. 

All three bounds on t will follow from reducing the problem to a special case of the Zarankiewicz 
problem in which we want to guarantee a Kt^t in a bipartite graph with parts of order at least pn/lQr 
and edge density at least I — e~'^ Although it is not difficult to improve our bounds by being a 
little more careful at a few points in the argument, we have chosen to present proofs which determine 
the correct behavior while remaining as simple as possible. 

By considering a random graph with edge density p, we see that the bounds in Theorem 15. 1 1 are close 
to being tight. Noting that every graph or its complement has edge density at least 1/2, we have the 
following immediate corollary of the second bound. 
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Corollary 5.2 For r fixed and n sufficiently large, 

g{r,n)>—e logn. 

Id 

Clique minors are a strengthening of weakly complete sequences, with the added constraint that the 
sets Si are required to be connected. A classical result of Mader [39] guarantees that for each t 
there is c{t) such that every graph on n vertices with at least c{t)n edges contains a clique minor of 
order t. Kostochka [351136| and Thomason [53] independently determined the order of c{t), proving 
that c{t) = Almost two decades later, Thomason [54| determined an asymptotic formula: 

c{t) = (a + o(l))t\/lnt, where a = 0.319... is a computable constant. 

In recent years, there has been a push towards extending these classical results on the extremal problem 
for graph minors to small graph minors, that is, where few vertices are used in making the minor. A 
result of Fiorini, Joret, Theis and Wood |22| says that, for each t, there are h{t) and fit) such that 
every graph with at least f{t)n edges contains a A't-minor with at most h{t)logn vertices. The log re 
factor here is necessary. Indeed, for each C there is c > 0 and an re-vertex graph with at least Cre 
edges and girth (which is defined as the length of the shortest cycle, but is also the minimum number 
of vertices in a ATs-minor) at least clog re. Fiorini et al. also conjectured that, for each e > 0, one 
may take f{t) = c{t) + e and h{t) = C{e,t). Shapira and Sudakov |l6] came close to proving this 
conjecture, showing that every re-vertex graph with at least (c(t) -|- e)re edges contains a iff-minor of 
order at most (^(e, t) log re log log re. Building upon their approach, Montgomery |l0| then solved the 
conjecture by removing the log log re factor. 

These results are all about finding clique minors in sparse graphs. Here, we study the dense case and 
find conditions on t and r such that every dense graph contains a iF^-minor where each connected set 
corresponding to a vertex of the minor contains at most r vertices. In particular, this shows that we 
may prove an analogue of Corollary 15.21 where the required subgraph is a clique minor rather than just 
a weakly complete r-sequence. 

Theorem 5.3 Let re be sufficiently large and G be a graph with re vertices and edge density p > . 

//< r < log re and t < ^e^’’ /^®®logre, then G contains a Kf-minor such that the 

connected sets corresponding to its vertices have size at most 8r. 

Minors in which the connected sets corresponding to vertices have small diameter are known as shallow 
minors. This concept was introduced in a paper by Plotkin, Rao and Smith |l2], though they attribute 
the idea to Leiserson and Toledo. Shallow minors also play a fundamental role in the work of Nesetfil 
and Ossona de Mendez on the theory of nowhere dense graphs (see their book m)- 
We mention this concept because the proof of Theorem 15.31 also gives that the connected subset 
corresponding to each vertex has diameter at most 9. A variant of this argument (using a different 
version of dependent random choice) can be used to reduce the diameter of the sets to 3, but with a 
slightly weaker bound on t. We also note that there are analogues of Theorem 15.11 when r is larger or 
smaller than the assumed range. However, the proof is the same, so we omit the details. 

We begin by proving Theorem 15.11 and then deduce Theorem 15.31 We will make use of the following 
three lemmas. 

Lemma 5.4 Let H = {Vi,V 2 ,E) be a bipartite graph with edge density p. There is a subset B <Z V 2 
with \B\ > p\V 2 \/2 such that every vertex in B has more than p|Fi|/2 neighbors in Vi. 
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Proof: Delete all vertices in V 2 of degree at most p|Vi|/2 and let B be the remaining subset of 
V 2 . The number of deleted edges is at most p|Vi||V2|/2 and hence there are at least p|Vi||V 2 | — 
p|Ti||1 ^2|/2 = PIT 1 IIV 2 I /2 remaining edges from B to Vi. As each vertex in B is in at most |l/i| edges, 
\B\>p\Vi\\V2\/2\Vi\=p\V2\/2. □ 

Lemma 5.5 Let H = {Vi,V 2 ,E) be a bipartite graph with edge density 1 — q. Then there is a subset 
B C V 2 with \B\ > 1^21/2 such that every vertex in B has more than (1 — 2g')|l/i| neighbors in Vi. 

Proof: Delete all vertices in V 2 of degree at most (1 — 2(7)|Pi| and let B be the remaining subset 
of V 2 - The number of nonedges touching the deleted vertices is at least 2(7|l/i||V2 \ B\ and at most 
q\Vi\\V 2 \. Hence, \V 2 \B\ < IP 2 I /2 and \B\ > IP 2 I/ 2 . □ 

Lemma 5.6 If H = {Vi,V 2 ,E) is a bipartite graph in which every vertex in V 2 has at least p\Vi\ 
neighbors in Vi, then there is a partition Vi = Ai U ... U into subsets of order r (so d = iPil/r/ 
such that the fraction of pairs (Aj, 6) with 6 G V 2 for which b does not have a neighbor in Ai is at most 
(I-pY < e-P^. 

Proof: Partition Vi uniformly at random into subsets Aj of size r. The probability that b has no 
neighbor in a subset Aj chosen uniformly at random is precisely the same as A, not containing any 
of the at least p\Vi\ neighbors of b in Vi, which is at most < (1 — pY■ Therefore, the 

expected fraction of pairs (Aj, b) for which b does not have a neighbor in Vi is at most (1 —pY- Hence, 
there is such a partition of Vi where the fraction of pairs (Aj, b) is at most this expected value. □ 

Proof of Theorem 15. H Let G = {V, E) be a graph on n vertices with edge density p. By considering 
a random equitable vertex partition of G, there is a vertex partition V = Pi U V 2 into parts of order 
n/2 such that the bipartite graph induced by this partition has edge density at least p. By Lemma 
Ea there is H C P 2 with \B\ > p|V2|/2 > pnjA such that every vertex in B has at least p|Pi|/2 
neighbors in Vi. By Lemma 15.61 there is a partition Vi = Ai U ... U A^ into subsets of order r (so 
d = |Pi|/r = n(2r) such that the fraction of pairs {Ai,b) with b ^ B for which b does not have a 
neighbor in Vi is at most p := (1 — p/2)'’. 

Consider the auxiliary bipartite graph X with parts {1,..., d} and B, where i is adjacent to 6 G H if 
there is at least one edge from 6 to Aj. The density of X between its parts is at least 1 — p. 

Case 1: p < 3/r. In this case, we have p < < 1 — pr/4 and hence the density of X between its 

parts is at least 1 — p > pr/4. 

Let S C {1,... ,d} consist of those vertices with at least pr|B|/8 neighbors in B. By Lemma 15.41 we 
have \S\ > prd/8 = pr(n/2r)/8 = pn/16. By Lemma 15.61 there is a partition B = BiD ... L) Bh into 
subsets of order r (so h = \B\/r > pn/^r) such that the fraction of pairs {i,j) with i £ S and j G \h] 
for which i does not have a neighbor in Bj in X (and hence Aj does not have an edge to Bj in G) is 
at most (1 — pr/8Y < 

Case 2: p > 3/r. In this case, we have p = (1 — p/2)'’ < and hence the density of X between 

its parts is at least 1 — p > 1 — 

Let S' C {1,... , d} consist of those vertices with at least (1 — 2e“^'’/^)|B| > (1 — e“^'’/‘^)|i?| neighbors 
in B. By Lemma [5.5l we have IS"! > d/2 = n/Arr. By Lemma [5.6l there is a partition B = BiD.. .CB^ 
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into subsets of order r (so h = |-B|/r > pnj^r) such that the fraction of pairs (i,j) with i S S' and 
j € [h] for which i does not have a neighbor in Bj in X (and hence Ai does not have an edge to Bj in 
G) is at most 

In either case, we obtain a bipartite graph T with parts S and [h] where (i,j) € S x [h] is an edge if 
Ai has at least one edge to Bj, the parts are of order at least N := pnjlQr and the edge density is 
1 — 5 for some 5 < Note that any Ktj in T forms a weakly bi-complete r-sequence of order t 

in G. 

If r > AYp~^ log n, then > 16 log re and this edge density is at least 1 — re“^, so T is a complete 
bipartite graph with parts of order at least pn/lQr. This gives the third desired bound. 

A classical result of Kovari, Sos and Turan m on the Zarankiewicz problem shows that if a bipartite 
graph T with parts of order at least N has density at least 1 — 5 and > {t — 1)('^), then 

the bipartite graph contains Kt^t- Notice that this inequality holds if (1 — 5 — > t/N. 

If r < p > re“^/^ and t < 4 ^ 10 ^( 32 /pA) ’ letting x = pr'^/8 < 1/2, we see that T has edge 

density at least 1 — e~^ > x/2. However, x > At/N, so T contains a Kt^t if (3^/4)* > t/N. But 

{x/Af > > 32p~^^‘^tn~^ > tlGr/pn = t/N, 

and we have shown the hrst desired bound. 

Suppose now that we are trying to obtain the second desired bound. Since < r < \/P~^ log re, 

p > and t < ^eP'^ log re, we have 5 > > re“^/® and t < Therefore, we have 

bN = 5pre/16r > pr/J'^/\tSr > n}l‘^ > t and t/N < re“^/^, so the desired inequality for the Kovari- 
Sos-Turan result holds if (1 — 25)* > re“^/^. Taking the logarithm of both sides and noting that 
log(l — 25) > —45 as 5 < e“^, we see that this holds as long as t < log re, which is a 

given assumption. This completes the proof of Theorem 15.II □ 

To prove Theorem 15.31 we combine the previous embedding technique with the following consequence 
of dependent random choice (discussed in the previous section) taken from [25] . 

Lemma 5.7 Let H = {U,V,E) be a bipartite graph with \U\ = \V\ = re/2 and at least pn^/A edges. 
Then, if p^n > 1600, there is a subset X C U with \X\ > pn/50 such that for every pair of vertices 
x,y £ X, there are at least 10~^p^n internally vertex-disjoint paths with four edges between x and y 
with internal vertices not in X. 

Proof of Theorem 15.31 Let G be a graph with edge density p on re vertices, so it has ^( 2 ) > pre^/4 
edges. By deleting vertices of degree less than pre/8 one at a time, we arrive at a subgraph G' with 
minimum degree at least pre/8 and at least prif /A — n{pn/8) = pre^/8 edges. Let v denote the number 
of vertices in G', so p^^'^n/A < v < n. 

Let H = {U,V,E) be a bipartite subgraph of G' with parts of order v/2 and at least pre^/32 edges 
such that the minimum degree of H is at least pre/32. Such a bipartite subgraph exists by considering 
a random bipartition. Note that the number of edges in H is at least pre^/32 = {pn^/8v‘^)v‘^/A. By 
Lemma EZ3 there is a subset X CL U with |X| > (pre^/8u^)u/50 > pre/400 such that for every pair of 
vertices x,y € X, there are at least 10“®(pre^/8u^)®u > 10“^'^p^re internally vertex-disjoint paths with 
four edges between x and y with internal vertices not in X. Let X' be an arbitrary subset of X of size 
exactly pre/400. 
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As every vertex in X (and hence X') has degree at least pnj32 in H, there are at least {pn/32)\X'\ 
edges between X' and V. Delete all vertices in V with fewer than (pn/32v)\X'\ neighbors in X' and 
let Z be the remaining subset of V. The number of edges between X' and Z is at least {pn/32)\X'\ — 
{pn/32v)\X'\{v/2) = pn\X'\/64. Note that \Z\ > pn/QA > \X'\. Let Z' C Z be a subset with 
\Z'\ = |X'| such that the number of edges between X' and Z' at least p\X'\\Z'\/Q4. Such a subset Z' 
exists by considering a random subset of Z of order \Z'\. Consider the bipartite graph H' between 
X' and Z'. It has 2\Z'\ vertices and at least p|X'||Z'|/64 = (p|A'|/64|Z'|)(2|Z'|)^/4 edges. Applying 
Lemma [AT] to H', there is a subset Y C Z' with |y| > (p|A'|/64|Z'|)(2|Z'|)/50 = p|A'|/1600 such 
that for every pair of vertices x,y & Y, there are at least 10“®(p|A'|/64|Z'|)®(2|Z'|) > 
internally vertex-disjoint paths with four edges between x and y with internal vertices not in Y. Since 
every vertex in Z has at least {pn/32v)\X'\ neighbors in X', the density between X' and Y is at least 
pn/32u. Let IT C X' be a subset of order |y| such that the edge density between W and Y is at least 
pn/32v > pj32. Such a subset exists by considering a random subset of X' of order \Y\. 

By Theorem 15.11 (or rather its proof, as we pass to a balanced bipartite subgraph), the bipartite graph 
between W and Y contains a weakly bi-complete r-sequence of order t with t = /^®®log|y| > 

AgP»’^/256 iQg Pqj. each part A used to make this weakly bi-complete r-sequence, fix a vertex a G A 
and consider any other vertex b G A. There are at least > 8 rt internally 

vertex disjoint paths, so we can find one of these internal paths so that the vertices have not already 
been used and add the three internal vertices of the path to connect a and b. Doing this for every 
vertex b G A\a, we get a connected set A' with at most 4r vertices. We can do this for each of the 2t 
sets A making up the weakly bi-complete r-sequence of order t. We thus obtain a Kty-rYmoi with each 
part corresponding to a vertex of order at most 4r. From a matching in the Kt^t, we get a iL^-minor 
with each part corresponding to a vertex of order at most 8r. □ 

To prove the claim that each set in the Xt-minor may be chosen to have diameter at most 9, suppose 
that A and B are two sets in the weakly bi-complete r-sequence and the union of the sets A' and B' 
formed from A and B corresponds to a vertex in the iLt-minor. If we let ab be an edge with a G A and 
b G B, we see that we could have chosen the sets A' and B' so that every vertex in A' is connected to 
a by a path of length 4 and every vertex in B' is connected to 6 by a path of length 4. Since a and b 
are joined, this clearly implies that A' U B' has diameter at most 9. 


6 Induced Ramsey numbers and Ruzsa—Szemeredi graphs 

A graph H is said to be an induced subgraph of G if V{H) C V{G) and two vertices of H are adjacent 
if and only if they are adjacent in G. We write G -^ind {Hi, H 2 ) if every red/blue-coloring of the edges 
of G contains an induced copy of Hi all of whose edges are red or an induced copy of H 2 all of whose 
edges are blue. The induced Ramsey number r[^(i{Hi, H 2 ) is the minimum N for which there exists a 
graph G on N vertices with G -^ind {Hi, H 2 ). When Hi = H 2 = H, we simply write rind(L^)- 
We will be interested in the induced Ramsey number of trees. It is easy to show that the usual Ramsey 
number for trees is linear in the number of vertices. For some trees, such as paths, it is even possible 
to show |34] that the induced Ramsey number grows linearly in the number of vertices. However, 
Fox and Sudakov |26] showed that there exist trees T for which the induced Ramsey number grows 
superlinearly. More precisely, they showed that r\^^[Ki^t)dvIt) is superlinear in t, where Mf is the 
matching with t edges. It is then sufficient if T contains both Ki^t and Mt as induced subgraphs. 

Fox and Sudakov [50] proved their result by an appeal to the regularity lemma. In this section, we 
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prove a strengthening of this result by showing that there is a close connection between Mn) 

and the celebrated Ruzsa-Szemeredi induced matching problem [H]. We say that a graph G = {V, E) 
is an {n,t)-Ruzsa-Szemeredi graph (or an (n,t)-RS graph, for short) if its edge set is the union of t 
pairwise disjoint induced matchings, each of size n. 

Theorem 6.1 If G -^md Afn); then G contains a subgraph which is an {n,t)-RS graph. 

Proof: Pick out disjoint induced matchings of size n from G until there are no more induced matchings 
of this size. If at least t induced matchings are picked out, then the union of these t induced matchings 
makes the subgraph of G which is an (n, t)-RS graph. Otherwise, we color the edges of the (fewer 
than t) induced matchings in red and the remaining edges in blue. The red graph won’t contain a red 
Ki^t as each of the (fewer than t) induced matchings contributes at most one to the degree of each 
vertex. Moreover, since we cannot pick out another disjoint induced matching, the blue graph does 
not contain an induced M„, contradicting our assumption that G —)-ind Mn). □ 

To recover the quantitative statement that rind(Ri,t, is superlinear in t, we use Theorem 16.11 to 
deduce that if G -^ind then G contains a {t,t)-KS graph. However, the work of Ruzsa and 

Szemeredi [H] shows that any such graph must have t(log* tY vertices for some positive constant c, 
where log* t is the slowly-growing function defined by log* t = 0 if t < 1 and log* t = 1 + log* (log t) 
if t > 1. Fox’s bound for the removal lemma [Mj) which we will discuss further in the next section, 
improves this estimate to * for some positive constant c. This yields the following corollary. 

Corollary 6.2 There exists a positive constant c such that rind{Ki^t, Mt) > *. 

To show that Ruzsa-Szemeredi graphs also give rise to graphs G for which G —>-ind Mn), we first 

show that we may assume our Ruzsa-Szemeredi graph is bipartite. 

Lemma 6.3 If there is an {n,t)-RS graph G = {V,E) on N vertices, then there is a bipartite {2n,t)- 
RS graph B on 2N vertices. 

Proof: Let B be the bipartite graph with parts Vi and V 2 , each a copy of V, where {u, u) € Vi x V 2 is 
an edge if and only if (u, v) is an edge of G. Each of the t induced matchings of size n in G corresponds 
to an induced matching of size 2n in B and these induced matchings make up H. □ 

It is now straightforward to show that bipartite Ruzsa-Szemeredi graphs G satisfy G —>-ind iKi,n, M^). 

Theorem 6.4 Suppose that c > 2 and G is a bipartite {cn,N/c)-RS graph on N vertices. Then 
G ^ind . 

Proof: In any red/blue-coloring of the edges of G, at least half of the edges are blue or half of them 
are red. In the former case, at least one of the induced matchings of size cn has at least half of its 
edges in color blue and, since cn/2 > n, these edges form a blue induced matching of size n. In the 
latter case, as there are nN edges and so at least nN/2 red edges, there is a vertex of red degree at 
least n. Since G is bipartite, this induces a red star Ki^n- 

As observed by Ruzsa and Szemeredi [H], a construction due to Behrend [ 8 ] allows one to show that 
there are {N /^, N /5)-RS graphs on N vertices. Applying Lemma 16.31 and Theorem 16.41 we get 
the following corollary. 
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Corollary 6.5 There exists a constant c such that rind{Ki^i, M^) < te'^Viogt_ 

7 Colored triangle removal 

The triangle removal lemma of Ruzsa and Szemeredi [H] states that for each e > 0 there is 5 > 0 
such that every graph on n vertices with at most 6 n^ triangles can be made triangle-free by removing 
at most en^ edges. That is, a graph with a subcubic number of triangles can be made triangle-free 
by removing a subquadratic number of edges. The triangle removal lemma has many applications in 
graph theory, additive combinatorics, discrete geometry and theoretical computer science. 

Until recently, the only known proof of the triangle removal lemma used Szemeredi’s regularity 
lemma [ 52 ] and gave a weak quantitative bound for namely, a tower of 2s of height polyno¬ 
mial in Recently, Fox | 21 | found a new proof which avoids the regularity lemma and improves 
the bound on to a tower of 2s of height logarithmic in It remains a major open problem to 
find a bound of constant tower height. 

It is easy to show that the triangle removal lemma is equivalent to the following statement: for each 
e > 0 there is <5 > 0 such that every tripartite graph on n vertices whose edge set can be partitioned into 
en^ triangles contains at least dn^ triangles. We present a simple proof of the following Ramsey-type 
weakening of this statement with a much better (only double-exponential) bound. 

Theorem 7.1 Every r-edge-coloring of a tripartite graph G with parts Vo,Ui,V 2 , where |Vi| = |1^| = 
n, |Vb| = cn and Vi is complete to V 2 , such that there is a collection of edge-disjoint monochro¬ 
matic triangles which cover the edges between Vi and V 2 contains at least monochromatic 

triangles. 

When > n^, two of the monochromatic triangles must share an edge between Vi and V 2 

and so we have the following corollary. 

Corollary 7.2 Suppose n > . Every r-edge-coloring of a tripartite graph G with parts 

Vq,Vi,V 2 , where |Vi| = IU 2 I = n, |Vo| = cn and Vj is complete to V 2 ) such that there is a collec¬ 
tion of edge-disjoint monochromatic triangles which cover the edges between Vj and V 2 contains a 
monochromatic diamond, i.e., an edge in two monochromatic triangles. 

One application of the triangle removal lemma, noted by Solymosi |50j , is a short proof for the corners 
theorem of Ajtai and Szemeredi [T]. This theorem states that for each e > 0 there is A^(e) such that, 
for N > A^(e), any subset S of the N x N grid with [S'! > eN'^ contains a corner, i.e., the vertices 
{x,y), {x + d,y), {x,y -V d) of an isosceles right triangle. This in turn gives a simple proof of Roth’s 
theorem |l3| that every subset of the integers of positive upper density contains a three-term arithmetic 
progression. 

The best known upper bound for the corners theorem, due to Shkredov |l9|, states that any subset 
of the N X N grid with no corner has cardinality at most V^/(log log where c > 0 is an absolute 
constant. Graham and Solymosi |33] proved a better bound for the Ramsey-type analogue of the 
corners theorem. They showed that there is c > 0 such that any coloring of the N x N grid with fewer 
than c log log V colors contains a monochromatic corner. This in turn implies a double-exponential 
bound for the van der Waerden number W (3; r) (the smallest N such that any r-coloring of the set 
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{1,2,... ,A^} contains a monochromatic three-term arithmetic progression), but this is weaker than 
the exponential bound that follows from the best quantitative estimate for Roth’s theorem I45|. 
Just as the triangle removal lemma implies the corners theorem. Corollary 17.21 implies the Graham- 
Solymosi bound on monochromatic corners in colorings of the grid. Indeed, consider an r-coloring of 
the N X N grid with N > Let Vq denote the set of 2N — 1 lines with slope —1 that each 

contain at least one of the grid points, Vi denote the set of N vertical lines that each contain N of 
the grid points and V 2 denote the set of N horizontal lines that each contain N of the grid points. 
Consider the tripartite graph G with parts Vo,Vi,V 2 , where two lines are adjacent if and only if they 
intersect in one of the points of the N x N grid, and color the edge between them the color of their 
intersection point. Note that every line in Vi intersects every line in V 2 in the grid, so Vi is complete 
to V 2 in the graph. Moreover, the three lines passing through any grid point form a monochromatic 
triangle and this collection of N'^ monochromatic triangles gives an edge-partition of G. Therefore, by 
Corollary 17.21 G contains a monochromatic diamond. This, in turn, implies that the coloring of the 
grid must contain a monochromatic corner. 

The proof of Theorem 17.11 follows from iterating the following simple lemma. 

Lemma 7.3 In every r-edge coloring of a tripartite graph G with parts Vq,Vi,V 2 , where |Vi| = IV 2 I = 
n, |Vo| = cn and the number m of edges between Vi and V 2 is at least nf sueh that there are fewer 
than 6n^ monochromatie triangles and there is a colleetion of edge-disjoint monochromatic triangles 
whieh eover the edges between Vj and V 2 , there are subsets V{ C Vi and V 2 C V 2 with \ V{\ = IR 2 I > ^ 
and a color such that the number of edges of that color between V( and Vf is at most 4Jn^. 

Proof: Let T denote a collection of edge-disjoint monochromatic triangles which cover the edges of 
G between Vi and V 2 ! so |T| = m. Partition Vq = AU B, where u G Vq is in ^ if the number of 
triangles in T containing v is at least 2 l^- number of triangles in T containing a vertex in B is 
less than |R| 2 j^ — y- Hence, at least half of the triangles in contain a vertex from A. 

Since each vertex in A is in at most n triangles from T, we have |H| > {ml2)jn > n/4. If every 

vertex in A is in at least 4(5n^ monochromatic triangles, the total number of monochromatic triangles 

is at least |H|4(5n^ > 5n^. Hence, there is a vertex u G H in fewer than 4(5n^ monochromatic triangles. 

2 

Since v £ A, there is a color, say red, such that v is in at least 2|^]7 — jtvr ~ Icr monochromatic 
red triangles from T. For i = 1,2, let V- denote those vertices in V) which are in a monochromatic 
red triangle from B with vertex v, so |H/| = IV^'I > Since v is in fewer than 4(5n^ monochromatic 
triangles and v is complete in red to V( and 1^', the number of red edges between and is at most 
4Jn^, which completes the proof. □ 

Proof of Theorem I7.lt Let /(n, r, q, s) be the minimum number of monochromatic triangles in an 
r-edge coloring of a tripartite graph G with parts Vq,Vi,V 2 , where \ Vi\ = \V 2 \>n and |Vb| < <?) such 
that the number of edges between Vi and V2 is at least |Vip — s and the edges between Vi and V2 
can be covered by edge-disjoint monochromatic triangles. In Theorem 17.11 Vi is complete to V 2 and 
|Vb| = cn, so we are trying to prove a lower bound on /(n, r, cn, 0), namely 

f(n,r,cn,0) > (4cr)~^'"^^. 

When r = 1, Lemma 17.31 implies that if G is a tripartite graph G with parts Vo,Li, V 2 , where |Tl| = 
IV 2 I = n, |Vo| = cn and e{Vi,V 2 ) > n^/2, such that G contains at most triangles and there is 
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a collection of edge-disjoint triangles in G which cover the edges between Vi and V 2 , then there are 
subsets V( C Vi and V 2 C V 2 with \V-l\ = \V 2 \ > ^ for which e{V{,V 2 ) < ASn?. In particular, if 
e(Ih, V 2 ) = v? — s, then 

A6n^ > e{V;,V^ >\Vl\\V^\ - s > (£)' - s, 
and hence 6 nP > Substituting q = cn, we get the bound 


f{n, l,q,s) 



ns 

T' 


When s < n^l2, Lemma FTSl implies that by deleting the edges of the sparsest color between V{ and 
V 2 and letting /o = /(n, r, q, s), we have 

/o > / + • 

rii = So = S and, for i > 1, s* = Si_i -h4/o/nj_i, so Si < s + 4/oX]*“o < s + 5/o/ni_i. 

After i iterations, if Sj_i < nl_il2, we get 

/o > f ini,r - i,q,Si). 


We set s = 0 and q = cn. We use the above inequalities to compute a lower bound on /o = f{n, r, cn, 0). 
Either we have Sj_i > nj_i/2 for some i < r or fo > / 1, cn, In the first case, we have 

^fo/iT'i -2 > Si-i > n?_;^/2 and, therefore, 


/o> 


ni-2n]_i nr-3nl_2 


10 


> 


10 


> (4cr) ^ n'^. 


In the second case, we have 


f ^ -f / 1 ^ \ ^r—l nSr—l 

Jo> J [rir-i, l,cn,Sr-i) > -^-> 


n'" 


64g^ 


5 n 
- 7 / 0 - 


in which case we get 


(4cr)5(2’’ 1 i)64c^ 4 nr- 2 ’ 

> f4crr2"^"n3 

2 n (4cr)5(2"-'-i)64c2 “ ^ ^ 

In either case, we have the desired inequality, which completes the proof of Theorem 17. II 


1 nr-2 
Jo > 7 - 


n 


□ 
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